WEB-PAGE PROCESSING METHOD FOR REDUCING LOAD OF 

SERVER 

FIELD OF THE INVENTION 

[0001] The present invention relates to a web-page processing method, 
and more particularly to a web-page processing method for reducing load of 
the associated server. 
BACKGROUND OF THE INVENTION 

[0002] Nowadays, Internet communication is prevailing and thus a 
variety of services and a number of commercial behaviors are conducted by 
means of the Internet technologies. Efforts have been being made to build a 
custom service platform on the Internet, which allows a large quantity of 
information to be exchanged thereon in a highly efficient manner. 
[0003] Referring to Fig. 1, a typical client-server networking 
architecture is shown. According to the client-server networking architecture, 
a client's side 1 1 and a server's side 12 are interconnected via Internet 13. At 
the client's side 11, via a terminal machine 1 1 1 such as a personal computer 
or a mobile phone and the Internet 13, a user 1 10 can exchange data with the 
server 121 at the server's side 12. 

[0004] Take a client-server networking architecture providing World 
Wide Web (WWW) service, where data are transmitted or exchanged 
between the client's side 1 1 and the server's side 12 in a Hyper Text Transfer 
Protocol (HTTP), for example. The server 121 has to process web pages 
stored in a database 122 and the terminal machine 111 should be installed 
with an interpreter, which is so-called as a browser 112 in order to interpret 
and execute the web pages. 

[0005] In the early stage of WWW service, the web pages were written 
in markup languages compatible with standard generalized markup language 
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(SGML), such as Hyper Text Markup language (HTML). Thus, such web 
pages could only be hyperlinked in an inactive form. With the increasing 
demand of interaction on Internet, active web pages become mainstream for 
web design works. In general, server-push means and client-pull means 
dominate the designing aim of active web pages, which will be described 
hereinafter. 

[0006] The server-push means is implemented by having the server 
actively control, refresh or change the contents of the web pages. The web 
pages formed in the server-push manner involve server-side commands, also 
referred as tags, and client-side commands. The client-side commands can be 
directly executed by way of the browser. The server-side commands, e.g. the 
commands constructed by Server Side Includes (SSI), Command Gateway 
Interface (CGI), Active Server Pages (ASP) or Perl Hypertext Processor 
(PHP) technology, on the other hand, need to be processed by the server so as 
to reveal the web pages. In contrast, the client-pull means is implemented by 
having the browser actively refresh the web pages. The web pages formed 
client-pull means involve only client commands. 

[0007] Referring to Fig. 2, an active web page 21 formed in the server- 
push manner is exemplified. In this example, the active web page 21 
contains client-side commands, i.e. HTML commands, and a server-side 
command I expressed as <I~#SSI CMD— >. The active web page 21 is 
converted into a specific entry of data in a common programmatic language 
such as C language before being stored into the database 122 (Fig. 1). The 
specific entry of data is stored in the database 122 in a byte-array form. 
Furthermore, the entry of data should be indexed from the filename of the 
web page 21 according to a file allocation table (FAT) utilized by the 
operating system of the server 121 (Fig. 1). The establishment of the file 
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allocation table is well known in the art and need not be further described in 
detail herein. 

[0008] Please refer again to Fig. 1. When the user 110 asserts a read 
request SI to the server's side 12 via the browser 1 12 to read the web page 21 
(Fig. 2), the server 121 will locate the data relevant to the web page 21 in the 
database 122 according to the filename information carried by the request SI 
and the file allocation table. Then, the server 121 asserts a read command S2 
to read the data having been located from the database 122. In response to a 
data output operation S3, the server 121 can obtain the web page 21. Before 
the web page 21 is transmitted to the client's side 11, a parsing operation of 
the web page 21 is performed by the server 121 to distinguish the SSI 
command I from the HTML commands. While the HTML commands are 
directly outputted to the browser 112 by a response operation S4, the SSI 
command I needs to be executed by the server 121 prior to response to the 
read request. For example, some server-inclusive information such as date, 
time or count of visitors, are inserted into the web page by executing the SSI 
command I first, and then outputted to the browser 112. 

[0009] The operation of the above web-page processing method is 
summarized in brief in the flowchart of Fig. 3. Web pages are transformed 
into a plurality of entries of data in a common programmatic language format 
(Step 1). The data are stored in a database and the access paths to the data in 
the database are recorded in the FAT (Step 2). In response to a read request 
from a browser at the client's side to read a specific web page (Step 3), the 
server asserts a read command to locate the address of a specific entry of data 
relevant to the requested web page according to the FAT. Then, the specific 
entry of data at the located address is outputted to the server (Step 4). In 
response to the receipt of the data from the database (Step 5), the server 
parses the tags of data (Step 6) to see if the currently received data is directed 

3 



to a server-side command (Step 7). If the data is a server-side command, it 
will be executed by the server and then the executing result is output to the 
browser (Step 8). On the other hand, if the currently received data is not a 
server- side command in Step 7, the received data contents will be directly 
outputted to the browser of the client's side (Step 10). Now back to Step 5, 
the end of the specific entry of data is determined when the flag representing 
the end of file (EOF) has been hoisted, and it means the processing of the 
requested web page has been completed. 

[0010] Since the parsing operation is basically a string-matching task, 
the processing procedure is time-consuming. For example, the speed of a 
server for processing a web page by omitting from performing the parsing 
operation can be up to several times of that of the same server for processing 
the same web page by performing the parsing operation. As the data amount 
of the web page increases, the load of the server is getting heavier. On the 
other hand, if the computation ability of the processor of the server is not 
strong enough, the service efficiency of the server would be significantly 
lowered. 

[0011] A conventional approach to the reduced load of the server upon 
processing a web page is to provide distinguishable filenames for the web 
pages requiring and not requiring the parsing operation, i.e. including and not 
including server-side command(s), respectively. For example, for the web 
pages containing no server-side command are ended with ".html" or ".htm" 
as usual. Differentially, the filenames of the web pages requiring the parsing 
operation are intentionally ended with ".phtml", ".phtm" or ".html-ssi". As 
such, only the web pages having the distinctive filename extensions 
like ".phtml", ".phtm" or ".html-ssi" need the parsing operation. As for the 
web pages having the common filename extensions like ".html" or ".htm", 
the parsing operation can be omitted so as to reduce the load of the server. In 
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spite this approach did enhance the efficiency of the server to some extent, it 
is not good enough particularly when the working efficiency of the processor 
of the server is not as high as required. 
SUMMARY OF THE INVENTION 

[0012] Therefore, the present invention provides a web-page processing 
method, in which the parsing operation performed by a server after receiving 
a read request from a browser at the client's side can be omitted, so as to 
reduce the load of the server. 

[0013] A first aspect of the present invention relates to a web-page 
processing method, comprises steps of: parsing web-page data to locate a first 
portion of contents associated with a server-side command; recording storage 
information of the first portion of contents in a database as a first index data; 
and outputting the first portion of contents to a server when the first index 
data is referred to in response to a read command of the server. 
[0014] Preferably, the parsing step further locates a second portion of 
contents associated with a client-side command, and the method further 
comprises steps of: recording storage information of the second portion of 
contents in the database as a second index data; and outputting the second 
portion of contents to the server when the second index data is referred to in 
response to the read command of the server. 

[0015] The first portion of contents is a first command block including 
one or more continuous server-side commands, and the second portion of 
contents is a second command block including one or more continuous client- 
side commands. 

[0016] Preferably, the first and second command blocks are 
immediately adjacent to each other. 

[0017] In an embodiment, the method further comprises steps of: 
executing the first command block by the sever, and outputting the executing 
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result to web-page request means; and outputting the second command block 
to the web-page request means without being executed by the server. The 
web-page request means, for example, is a browser of a client's side. 
[0018] The client-side command, for example, is written in a Hyper 
Text Markup language (HTML). 

[0019] In an embodiment, the first and second index data are stored in 
the database as an array or a linking list, and are referred to according to the 
filename of the processed web page and a file allocation table associated with 
an operating system of the server. 

[0020] Preferably, the first and second index data include one or more 
items selected from a group consisting of a command-identifying code, 
starting address of the storage position, end address of the storage position 
and total length of the first portion of contents. 

[0021] In an embodiment, the web-page data is converted from a web 
page written in a markup language compatible with a standard generalized 
markup language (SGML) and have a specified format of a common 
programmatic language. 

[0022] For example, the server-side command is constructed by Server 
Side Includes (SSI), Command Gateway Interface (CGI), Active Server 
Pages (ASP) or Perl Hypertext Processor (PHP) technology. 
[0023] According to a second aspect of the present invention, the web- 
page processing method comprises steps of: building an index file of a web 
page, wherein information of a first command block including one or more 
continuous server-side commands and a second command block including 
one or more continuous client-side commands are recorded; referring to the 
index file in response to a web-page read request; locating and outputting the 
first command block to a server according to information of the first 
command block recorded in the index file; and locating and outputting the 
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second command block to the server according to information of the second 
command block recorded in the index file. 

[0024] In an embodiment, the web-page processing method further 
comprises steps of: executing the first command block by the sever, and 
outputting the executing result to web-page request means; and outputting the 
second command block to the web-page request means without being 
executed by the server. 

[0025] Preferably, the index file is referred to in response to the web- 
page read request according to the filename of the web page and a file 
allocation table associated with an operating system of the server. 
[0026] A third aspect of the present invention relates to a web-page 
processing method, comprising steps of: referring to an index file which 
records therein respective storage information of server-side and client-side 
commands of a specific web page in a database in response to a read request 
from web-page requesting means; locating and outputting the server-side and 
the client-side commands to a server according to the storage information 
recorded in the index file, respectively; executing the server-side command 
by the server; and outputting the specific web page from the server to the 
web-page requesting means. 

[0027] Preferably, the specific web page is parsed to distinguish a first 
command block including one or more continuous server-side commands 
from a second command block including one or more continuous client-side 
commands, and storage information of the first and second command blocks 
in the database are recorded in the index file. 

[0028] In an embodiment, the specific web page outputted from the 
server to the web-page requesting means includes contents associated with 
the executing result of the server-side command and contents associated with 
the client-side command without being executed by the server. 

7 



[0029] The above objects and advantages of the present invention will 
become more readily apparent to those ordinarily skilled in the art after 
reviewing the following detailed description and accompanying drawings, in 
which: 

BRIEF DESCRIPTION OF THE DRAWINGS 

[0030] Fig. 1 is a schematic block diagram illustrating a typical client- 
server networking architecture; 

[0031] Fig. 2 is a schematic diagram illustrating an active web page 
containing client-side and server-side commands; 

[0032] Fig. 3 is a flowchart of a conventional web-page processing 
method; 

[0033] Fig. 4 is a schematic diagram illustrating the parsed result of the 
active web page of Fig. 2 according to an embodiment of the present 
invention; and 

[0034] Fig. 5 is a flowchart of an exemplified web-page processing 
method according to the present invention. 

DETAILED DESCRIPTION OF THE PREFERRED EMBODIMENT 
[0035] A web-page processing method according to a preferred 
embodiment of the present invention will be illustrated hereinafter. The load 
of the associated server can be reduced in the present method by eliminating 
the parsing operation conventionally performed after receiving a read request 
from a browser at the client's side. 

[0036] Generally, the web pages are converted into a plurality of 

entries of data in a common programmatic language format such as C 
language format, and then stored in the database 122 in a byte-array form. 
Each of the web pages may optionally include one or more server-side 
commands and one or more client-side commands. The server-side 
commands can be constructed by Server Side Includes (SSI), Command 
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Gateway Interface (CGI), Active Server Pages (ASP) or Perl Hypertext 
Processor (PHP) technology, and need to be processed by the server so as to 
reveal the web page. The client-side commands are typically written in a 
Hyper Text Markup language (HTML). According to the present invention, 
index files recording therein a plurality of index data relating to specified 
information of web pages are additionally built and stored in the database 
where the contents of the web pages are stored. 

[0037] In order to build the index files, each of the web pages is parsed 
to locate the portion associated with the server-side command(s) and the 
portion associated with the client-side command(s). The active web page as 
illustrated in Fig. 4 is used herein as an example for illustration. Referring to 
Fig. 4, the active web page 21 containing the client-side commands (i.e. 
HTML commands) and the server-side command I (e.g. expressed as <I— 
#SSI CMD->) is parsed to distinguish the server-side command from the 
client-side commands and locate three adjacent command blocks Bl, B2 and 
B3. The command block Bl consists of a plurality of continuous client-side 
commands starting from Bll and ending at B12. The command block B2 
consists of a single server-side command, i.e. the starting and ending 
positions B21 and B22 are the same. The command block B3 consists of a 
plurality of client-side commands starting from B31 and ending at B32. A 
variety of information associated with the command blocks, including the 
their storage addresses in the database, can be recorded as corresponding 
index data in the index file so that the command blocks can be extracted from 
the database when required by referring to the index file. The index data, for 
example, include command-identifying codes (e.g. for identifying the 
command blocks Bl, B2 and B3), starting addresses of the command blocks 
(e.g. corresponding to Bll, B21 and B31), end addresses of the command 
blocks (e.g. corresponding to B12, B22 and B32) and total length of the 
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command blocks. Since the storage positions of the command blocks in the 
database can be realized by referred to the index data, the data relevant to the 
requested web page can be outputted in response to the read request from the 
browser of the client's side without any real-time parsing operation. 
[0038] Preferably, the index data are stored in the database in an array 
or a linking list format, and are referred to according to the filename of the 
requested web page and the file allocation table (FAT) associated with an 
operating system of the server 121. 

[0039] When the user 110 asserts a read request SI to the server's side 
12 via the browser 1 12 to read a specific web page (with reference to Fig. 1), 
the server 121 asserts a read command S2 to locate the index file in the 
database 122 according to the filename information carried by the request SI 
and the file allocation table. According to the index data in the index file, the 
storage positions of the command blocks in the database 122 are realized. 
Then, the server 121 asserts another read command to read the data relevant 
to the requested web page (hereinafter referred to as web-page data) from the 
database 122. In response to the read command, the database 122 outputs the 
web-page data to the server in blocks by a data output operation S3. Herein, 
that the web-page data are outputted in blocks means that the data in the same 
commend block can be continuously outputted or preliminarily executed by 
the server because the web-page data have been previously parsed to isolate 
the server-side command block. For example, the client-side commands in 
the command block B 1 of the web-page data of Fig. 4 can be outputted from 
the server 121 to the browser 112 by a response operation S4 quickly and 
continuously. Once the server-side command block B2 is entered, the server- 
side command in the server-side command block B2 is automatically 
interpreted and executed by the server 121 first, and then the executing result 
is outputted from the server 121 to the browser 1 12 by a response operation. 
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After the command block B2 is processed and the client-side command block 
B3 is entered, the client-side command block B3 can be outputted from the 
server 121 to the browser 112 quickly and continuously again. By this way, 
the requested web page can be efficiently outputted to the browser 1 12 of the 
client's side 11. 

[0040] The embodiment of the web-page processing method described 
above is summarized in brief in the flowchart of Fig. 5. In the data write-in 
stage, web pages are converted into web-page data in a common 
programmatic language format (Step 1). Each of the web-page data, e.g. tags, 
is parsed to locate the server-side and client-side command blocks, 
respectively, and the information of the command blocks are recorded as 
index data in an index file. Of course, it is possible to perform the parsing 
operation only for the web-page data including therein server-side 
command(s). The access path to the index file is recorded in the file 
allocation table (FAT), and the web-page data and index file are stored in a 
database (Step 3). In response to a read request from a browser at the client's 
side to read a specific web page (Step 4), a read command is asserted by the 
server to locate the index data by referring to FAT. According to the index 
data, the storage position of the index file relevant to the requested web page 
in the database is realized. Therefore, the command blocks can be read out 
from the database by referring to the corresponding index data in the index 
file (Step 5). In response to the receipt of the index data from the database 
(Step 6), the command blocks are outputted from the database to the server 
accordingly. If what are currently received by the server are the contents of 
the web-page data associated with a client-side command block (Step 7), the 
commands in the same client-side command block are quickly and 
continuously outputted from the server to the browser (Step 9). On the other 
hand, if it is the contents associated with the server-side command block 
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received (Step 7), the server-side command will be executed first by the 
server, and then outputted to the browser (Step 8). When the flag 
representing the end of file (EOF) has been hoisted in Step 6, it means the 
processing of the requested web page has been completed. 
[0041] Since the parsing operation is previously performed in the data 
write-in stage other than run-time stage, the web-page processing method of 
the present invention is capable of reducing load of the server. 
[0042] While the invention has been described in terms of what is 
presently considered to be the most practical and preferred embodiments, it is 
to be understood that the invention needs not be limited to the disclosed 
embodiment. On the contrary, it is intended to cover various modifications 
and similar arrangements included within the spirit and scope of the 
appended claims which are to be accorded with the broadest interpretation so 
as to encompass all such modifications and similar structures. 
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